- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources2
- Resource Type
-
0001000001000000
- More
- Availability
-
20
- Author / Contributor
- Filter by Author / Creator
-
-
Horowitz, Mark (2)
-
Liu, Qiaoyi (2)
-
Sreedhar, Kavya (2)
-
Strange, Maxwell (2)
-
Barrett, Clark (1)
-
Bhagdikar, Nikhil (1)
-
Carsello, Alex (1)
-
Chen, Po-Han (1)
-
D’Agostino, Brandon (1)
-
Feng, Kathleen (1)
-
Joshi, Pranil (1)
-
Kong, Taeyoung (1)
-
Koul, Kalhan (1)
-
Mann, Makai (1)
-
Melchert, Jackson (1)
-
Myers, Zach A. (1)
-
Nayak, Ankita (1)
-
Nyengele, Gedeon (1)
-
Raina, Priyanka (1)
-
Richardson, Stephen (1)
-
- Filter by Editor
-
-
Piskac, Ruzica (1)
-
Whalen, Michael W. (1)
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Amber is a system-on-chip (SoC) with a coarse-grained reconfigurable array (CGRA) for acceleration of dense linear algebra applications, such as machine learning (ML), image processing, and computer vision. It is designed using an agile accelerator-compiler co-design flow; the compiler updates automatically with hardware changes, enabling continuous application-level evaluation of the hardware-software system. To increase hardware utilization and minimize reconfigurability overhead, Amber features the following: 1) dynamic partial reconfiguration (DPR) of the CGRA for higher resource utilization by allowing fast switching between applications and partitioning resources between simultaneous applications; 2) streaming memory controllers supporting affine access patterns for efficient mapping of dense linear algebra; and 3) low-overhead transcendental and complex arithmetic operations. The physical design of Amber features a unique clock distribution method and timing methodology to efficiently layout its hierarchical and tile-based design. Amber achieves a peak energy efficiency of 538 INT16 GOPS/W and 483 BFloat16 GFLOPS/W. Compared with a CPU, a GPU, and a field-programmable gate array (FPGA), Amber has up to 3902x, 152x, and 107x better energy-delay product (EDP), respectively.more » « less
-
Tsiskaridze, Nestan; Strange, Maxwell; Mann, Makai; Sreedhar, Kavya; Liu, Qiaoyi; Horowitz, Mark; Barrett, Clark (, Proceedings of the 21st International Conference on Formal Methods In Computer-Aided Design (FMCAD '21))Piskac, Ruzica; Whalen, Michael W. (Ed.)The increasing complexity of modern configurable systems makes it critical to improve the level of automation in the process of system configuration. Such automation can also improve the agility of the development cycle, allowing for rapid and automated integration of decoupled workflows. In this paper, we present a new framework for automated configuration of systems representable as state machines. The framework leverages model checking and satisfiability modulo theories (SMT) and can be applied to any application domain representable using SMT formulas. Our approach can also be applied modularly, improving its scalability. Furthermore, we show how optimization can be used to produce configurations that are best according to some metric and also more likely to be understandable to humans. We showcase this framework and its flexibility by using it to configure a CGRA memory tile for various image processing applications.more » « less
An official website of the United States government
